Missing levels in correlated spectra 
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Abstract 

Complete spectroscopy (measurements of a complete sequence of consecutive 
levels) is often considered as a prerequisite to extract fluctuation properties of 
spectra. It is shown how this goal can be achieved even if only a fraction of levels 
are observed. The case of levels behaving as eigenvalues of random matrices, of 
current interest in nuclear physics, is worked out in detail. 

It is by now well established that, at least with present statistical significance limited 
by the relatively scarce amount of high quality data available, the statistical properties 

1 



of compound nucleus resonances with fixed quantum numbers are consistent with the 
predictions of the Wigner-Dyson random matrix model (Gaussian Orthogonal Ensem- 
ble, G0E)[1]. It is presently understood, not in the form of mathematical theorems but 
rather from numerical experiments and from theoretical insight based on semiclassical 
approaches, that the origin of this (universal) behavior has to be found in the chaotic 
character of the underlying classical motion [2]. Analyses have been mainly performed 
for two degrees of freedom systems (one particle in a two-dimensional box) , which can 
be also experimentally exemplified in studying resonances of microwave cavities with 
adequate boundaries or of quartz or metallic blocks (elastodynamics) (see, for instance, 
contributions by A. Richter, H. J. Stockmann and C. Ellegaard[3]). 

Efforts have also been devoted to find out from which excitation energy on, starting 
from the ground state (where, by analyzing the systematics of ground state energies, 
evidence for the coexistence between regular and chaotic motion has been given [4]) 
the 'random matrix theory' behavior holds[5]. This is a difficult task because, below 
neutron threshold, one can hardly be sure that all levels are detected. Work is also 
currently performed to investigate the statistical properties of the fine structure of 
analog or other giant resonant states, at excitation energies which may be below or far 
above the neutron threshold [6]. 

At different degrees, most of these studies must face the difficulty of establishing 
the extent to which the sequences of energies analyzed are complete ( no missing levels) 
as well as how the missing levels may distort the statistical properties of the otherwise 
ideal (complete) sequence. It is the purpose of this letter to present some results which 
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may be useful in this context. 

The most ambitious goal may be stated as the one of detecting the location of, say, 
one missing level on an otherwise complete sequence. Dyson, in a recent review[7] , uses 
information theory concepts and argues that correlations in a sequence may provide 
the necessary redundancy from which error correcting codes can be constructed. At 
one extreme where no correlations and therefore no redundancy are present (Poisso- 
nian sequence), there is no possibility of detecting one missing level. At the other 
extreme, a sequence of equally spaced levels (picket fence ), there is a maximum re- 
dundancy and a missed level can be obviously detected as a hole in the spectrum. 
Eigenvalues of random matrices, which exhibit characteristic correlations, correspond 
to an intermediate situation between these two extremes. The attempts to locate in 
the last case a single missed level have remained unsuccessful so far. However it should 
be mentioned that for two-dimensional chaotic systems where, besides correlations of 
the order of one mean spacing as described by random matrices, the presence and the 
role of long range correlations governed by the shortest periodic orbits and reflected in 
Weyl's law describing the average spectral density, is well understood. It is then pos- 
sible to approximately locate, from the study of spectral fluctuations, a single missed 
level[8]. 

Here we address a less ambitious question, namely to study how the spectral fluc- 
tuations of an ideally complete sequence are affected when a fraction / (0 < / < 1) 
of levels are detected. It will be assumed that the sequence is infinite and stationary 
and that the levels are dropped at random from the complete sequence (random sam- 
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pling). The last assumption is presumably justified when observing compound nucleus 
resonances with limited detection sensitivity. Indeed, in random matrix theory level 
positions and widths are uncorrelated, a feature which is confirmed by the analysis of 
experimental data. 

We start considering the n-point correlation functions Rn{Ei,E2,...,En) that give 
the joint probability density of finding levels located around each of the values £'1, £'2, £, 
For the incomplete sequence these functions keep their form and they are only reduced 
by a factor /" since / is the probability of observing one given level. We can therefore 
write the relation 



where small cases denote the actual observed quantities and capital letters those of 
the complete spectrum. We shall keep this notation in what follows, namely use the 
same letter, capital or small case, to refer to corresponding quantities for complete or 
incomplete sequences, respectively. For the spectral density (1) gives 



and, from (2), the unfolding procedure that maps the levels onto a new spectrum whose 
density is equal to one yields the relation 





(2) 




(3) 
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where k = 1,2, ... labels the levels which are observed. The normalization will always 
be kept the same (the mean spacing < s >— 1 for the complete as for incomplete 
sequences) . This is in contrast to some of the expressions given in [9] . 

One is usually interested in the n-point cluster functions F„ (Xi, X2, X„) from 
which lower order correlations are subtracted. Prom their definition [10] and (1) we 
deduce 



These important equations show that the correlation functions keep their form, the 
only modification is a rescaling of variables. Thus, if the fraction / is known, the 
correlation functions can be reconstructed from the partially observed spectrum. 

It is well known that the number variance (L) (variance of the number of levels 
contained in an interval of length L) and the Dyson-Mehta A-statistics (least square 
deviation, in an interval of length L, of the actual staircase counting function from its 
linear approximation) are related to the two-point cluster function through [10] 







A(L) 



L 



[ {L-xf (2L2 - 9xL - 2,x^) Y2 {x) dx 
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Prom (4) it follows immediately that 
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and 



a' (L) = (1 - /) L + fY? 



(6) 



^(^) = (l-/)^ + /'A(y). (7) 

Notice the appearance of a linear term in both equations, if / < 1. 

Another set of statistical measures are the E{n, s) functions that give the proba- 
bilities of finding n levels, with n = 0, 1, 2..., inside an interval of length s. If for the 
complete spectrum they are given by E{n, s), then their expressions if only a fraction 
/ of levels is observed is 

<».^) = E;j(Fr^/"(i-/)'-"^(*.f)- («) 

This follows from the fact that 1 — / is the probability that one level is missed, / 
that it is not and counts the number of ways k — n points can be removed 

from k points. Prom e(n, s), the spacing distributions p{n,s), the probability density 
of finding n levels (n = 0, 1,2...) between two given levels separated by a distance s, 
can be derived using the relation 

k=0 

one obtains 
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^M = j:^jli-^r(i-fr'p(k,j). (10) 



k=n 



For the nearest neighbor distribution (NND), i.e. p{s) — p{0,s) (10) becomes 

oo 

p{s) = J2i^-f)'P{k,j). (11) 

k=Q ■' 

The equations derived above imply that the general relation 

oo 

Z/2 (x) = 1- ^p{n,x) 

n=0 

is fulfilled, as it should. 

Expression (11) was first proposed as an anzatz in ref. [11], and in ref. [9] it 
has been shown that the coefficients / (1 — /)*^ maximize the Shannon entropy with 
constraints appropriately defined. Here it is obtained in a direct way. 

We first apply, as a check, the previous equations to a Poissonian spectrum of 
uncorrelated levels in which case they must remain invariant (they should not depend 
on /). If, for example, we substitute in Eq. (8) the expression 



s 

E{n,s)^-eM-s) (12) 
for the ^'-functions of a Poisson sequence, one obtains 
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It follows that any other measure derived from the E{n,s) functions will also remain 
invariant. Since Y2 {Xi, X2, Xn) — 0, the n-point cluster functions of the incomplete 
sequence also vanish and all other measures derived from them also remain invariant. 

Before considering GOE spectra with a fraction of observed levels, let us first recall 
some of the GOE expressions (/ = 1) to be used[10]. The two-point cluster function 
reads 



Y2 {x) = 



sminx) 



nx 



cos(7ra;) sin(7ra;) 



nx 



(TTXl 



(14) 



- [Si (ttx) — 7re (x) 

where Si (ttx) is the sine-integral and e (x) = ±| if x > or x < and e (0) = if 
X — 0. The number variance S^(L) is given by 



where 7 = 0.577... is Euler's constant and the A-statistics is given by 



(15) 



A(L) 



ln(27rL) + j + 



-i + 0(L-') 



(16) 



The NND distribution is well approximated by the Wigner surmise (2x2 matrices) 



P(.) = P(0,.) = |.exp(-p). (17) 

The next to nearest neighbor distribution P(l, s) is given by the NND of the symplectic 
ensemble which again is well approximated by the 2x2 matrix result 
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(care has been take to insure (s) = 2). The higher {k — 2, 3, ...) spacing distributions 
P{k, s) are well approximated by their (Gaussian) asymptotic form, centered at /c + 1 
and variances V {k) given by[12] 

V^{k):^T?{L = k)-\. (19) 

We apply now the above expressions to GOE spectra when only a fraction / of levels 
is observed. On Fig 1 is displayed for different values of / the function Y2, illustrating 
the scahng behavior Eq. (4) using (14). 

By comparing Eq. (6) with (15)and (7) with (16) one immediately sees one major 
effect of the incompleteness of a sequence. Instead of a logarithmic increase of the 
number variance or of the Dyson-Mehta statistic (/ = 1), one has a linear increase for 
/ < 1 of slope (1 — /) or (1 — /) /15 respectively. 

Consider finally p (s), the NND very often discussed in the literature. It is given by 
(11), (17), (18), (19). The range of the argument determines the number of terms to 
be included in (11). The slope at the origin given by the first term in (11) is increased 
by a factor 1// with respect to the GOE value. Let us also mention that, in fact, 
Eq. (11) was already used to analyze high quality data of ^^^U neutron resonances [13] 
and of ^^Ti proton resonances [9]. It was concluded that a fraction of about 10% of the 
resonances is missing in both cases and that the correction is important in investigating 
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the parity dependence of nuclear level densities[14]. 

It is convenient, in particular to exhibit the asymptotic behavior of p (s), to rewrite 
Eq. (11) in a different form. Let us first separate the first K terms in the sum and 
approximate the rest (the infinite sum from k — K to k — oo) by an integral in which 
k is treated as a continuous variable 



h — K 1 /»ryn 77 r -1 

1 . s 



p[s)^ y {l-f)''P{k,-)+r^J^exp 
h f Jk y/2nVW) 

(20) 

A steepest descent approximation to the integral can be worked out by mapping the 
exponent in the integrand onto a parabola as 

F{k) = -k ln(l - /) + ^^(^ -k-ir^ F{ks) + t\ (21) 

where /c^, the stationary point, is the root of the equation F'{k) = 0. This transcendent 
equation can be solved approximately if one considers the variance V'^{k) as a slowly 
varying function of its argument, in which case one can write 



K^j-l + V\j-l)H^-f). (22) 
Replacing then the coefficient — ^===^ by its value at the stationary point, we obtain 



the closed expression 



P{s) = E (1 - ff P{K 7) + \ exp [-F{ks)\ [1 - eri{ts)\ , (23) 
fe=o 
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where 



I -y/F(K)-F(ks) for ks>K 
In the asymptotic region, 4 large, (23) takes the simple expression 




^/F{K) - F{ks) for ks<K 



(24) 



1 



exp - ln(l - /) 



(25) 



1-/ 



illustrating that the spacing distribution approaches a Poisson distribution as / tends 
to zero. We have therefore a family of NND distributions, parametrized by the fraction 
/ of the observed levels, that interpolates between the GOE and the Poisson statistics. 

In Fig 2 comparison is made with numerical simulations. It can be seen that for 
/ = 0.90 the asymptotic regime Eq. (25) is not yet reached in the range considered 
whereas for / = 0.50 it is already reached for s ^ 2. 

In conclusion, we have shown how the problem of missing, at random, a fraction of 
levels of a correlated spectrum can be solved. From the observation of an incomplete 
spectrum the results presented here have a twofold application: i) if the missing fraction 
is known, one can recover the statistical properties of the complete spectrum, ii) if the 
statistical nature of the complete spectrum is known, an estimate of the fraction of 
missed levels can be obtained. 
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Fig. 1 The two-level cluster function Y2 . Full line: Eqs. (4) and (14). Points: 
obtained with numerical simulation for different values of the fraction / of levels ob- 
served. 

Fig. 2 Nearest neighbor spacing distribution p (s) for two values of / . Full line: the- 
ory Eq. (11) or (23). Dashed hne: Wigner surmise Eq. (17). Dotted line: asymptotic 
behavior as given by Eq. (25). Histograms: numerical simulation. 
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